A Bayesian Bernoulli-Exponential joint model for binary longitudinal outcomes and informative time with applications to bladder cancer recurrence data

Background A variety of methods exist for the analysis of longitudinal data, many of which are characterized with the assumption of fixed visit time points for study individuals. This, however is not always a tenable assumption. Phenomenon that alter subject visit patterns such as adverse events due to investigative treatment administered, travel or any other emergencies may result in unbalanced data and varying individual visit time points. Visit times can be considered informative, because subsequent or current subject outcomes can change or be adapted due to previous subject outcomes. Methods In this paper, a Bayesian Bernoulli-Exponential model for analyzing joint binary outcomes and exponentially distributed informative visit times is developed. Via statistical simulations, the influence of controlled variations in visit patterns, prior and sample size schemes on model performance is assessed. As an application example, the proposed model is applied to a Bladder Cancer Recurrence data. Results and conclusions Results from the simulation analysis indicated that the Bayesian Bernoulli-Exponential joint model converged in stationarity, and performed relatively better for small to medium sample size scenarios with less varying time sequences regardless of the choice of prior. In larger samples, the model performed better for less varying time sequences. This model’s application to the bladder cancer data showed a statistically significant effect of prior tumor recurrence on the probability of subsequent recurrences.


Introduction
Longitudinal data entail observations collected repeatedly on subjects over time.In medical research, the collection of correlated, longitudinal data is a common phenomenon.Ranging from the assessment of response changes and trends over time to understanding disease progression, the benefits longitudinal approaches are enormous [1,2].A defining feature of longitudinal data is the dependency that characterizes observations extending over time, the type of outcome measured and sometimes, the assumption of fixed time measurements for subjects [3][4][5].The broad assumption of fixed time measurements, predetermined by study design, however is not always a tenable assumption.For instance, in a clinical trial, there is the potential for different visit mechanisms.Study subjects are likely to miss scheduled visits, and a proportion of them are prone to adverse events from investigative treatments.Also, due to poor health conditions, individuals may self elect to visit the investigative site or hospital more intensely than their study counterparts.These occurrences may result not just in unbalanced data for subjects, but also varying visit profiles.Thus, the time structure adopted for the study can be considered informative.In a broad sense, this indicates that outcomes measured at subsequent time points are influenced or can be adapted based on outcomes measured in current time.This necessitates the use of advanced methods that address the informative time structure rather than standard, traditional approaches, which are limited by the assumption of fixed time.To handle such scenarios, Bronsert [6] developed a classical joint model, involving Gaussian outcomes and exponentially distributed informative time.Later, Alomair [7] extended Bronsert's model to include time dependent covariates.Classical informative time joint models have also been developed by Seo [8], involving longitudinal outcomes from the exponential families and exponentially distributed informative time.These joint models used the maximum likelihood estimation approach for estimating model parameters, and the authors broadly discussed associated computational complexities.
A Bayesian technique for modeling joint longitudinal outcomes and informative time points has been developed by Zaagan [9] but only for Gaussian distributed outcomes.The objectives of this research paper are twofold.First, we develop a Bayesian joint model for analyzing binary longitudinal outcomes and informative times.Then, via statistical simulations, we examine the influence of controlled variations in subject visit patterns, different prior specifications and sample size schemes on the proposed model.This proceeds with model convergence assessment and model evaluation.The proposed Bayesian-Exponential joint model is applied to a Bladder cancer recurrence data resulting from a clinical trial involving patients with bladder cancer conducted by the Veterans Administration Co-operative Urological Research Group (VACURG) [10,11].

The Bayesian Bernoulli-Exponential joint model formulation and likelihood specification
The exponential family of distributions covers a broad range of response distributions including Gaussian and Non-Gaussian distributions [12,13].For example, the Normal, Gamma, Poisson, Bernoulli, and Beta distributions are a part of the parametric set of distributions included in the family.Suppose the observations y 1 , y 2 , y 3 , • • • , y n are independent observations of a response variable, the exponential family of distributions from which the independent observations are sampled, can be specified as Where, • θ i represents the canonical parameter.
• φ is a scale parameter and m i (•), s(•) , and r(•) are known functions which relates to the variances of distributions in the exponential family.• m i (φ) can be specified as m i (φ) = φ u i , and u i 's are predetermined weights.
The canonical or location parameter characterizes a so called canonical link function, and relates to the means of the distributions in the exponential family.
Assume we have a set of n participants enrolled in a clinical trial, have to visit an investigative site over time and are followed over an interval from (0, τ ] .A response observation for the ith participant measured at the kth visit time point can be specified as y ik .We can further specify vectors of individual responses and their associated visit time points as Here, the subscript n i allows for varying participant visit times.We can thus specify the joint distribution of recorded responses and time points as where is a vector of unknown parameters to be estimated.Using these ideas, and in line with Seo [8] we can further specify a model that incorporates the joint distribution of responses and time points y ik and t i n with the underlying assumption that the current response depends on the one-step prior response y ik−1 , and cur- rent visit time point (t ik ) .It is important to note, how- ever, that subsequent responses, y ik will not be solely conditioned on observation time, t ik but also on the most recent prior response, y ik−1 and observation time.This distribution can be specified as; (1) This formulation forms the premise for specifying the joint model with response observations sampled from the Bernoulli distribution.Time is considered informative and assumed to be exponentially distributed.The joint distribution for binary longitudinal outcomes and informative time given the underlying assumption of a one step dependency can be specified as; Note that, µ ik = E(Y ik ) = P(Y ik = 1).More specifically for the Bernoulli distribution the link function can be specified as a logit link which in the context of this study can be expressed as; Furthermore, the specified mean function for the initial value for the ith participant and that after the initial value can be expressed as respectively.Hence, our final model specification for the parametric joint Bernoulli-Exponential model can be expressed as; (3) (4) (5) Where, • α is a vector of regression parameters denoting the effect of covariates on observed responses.• ψ represents the effect of the prior responses on aver- age current responses.• ϑ represents the effect of current response time on the mean responses, • ξ is a constant parameter associated with time • γ characterizes the effect of previous response on mean time and X is the design matrix.
The resulting likelihood function, a product of the density functions for s subjects, can be specified as, It is further important to clarify, that one key underlying assumption of this model, following Lin and Ying [14],Lin, Scharfstein, and Rosenheck [15], Liang, Wenbin (8) L �, y 1 , y 2 , and Zhiliang [16] and Sun, Sun, and Liu [17], is that censoring time, Z i in this study is noninformative in the sense that given covariates (X i ), Z i is independent of the observation times {t ik , k ≥ 1} and longitudinal outcomes Y i (•) .This basically means that given the covariate history up to time k, the distribution of the future covariate path up to any time t > k is independent of whether or not there is an observation on X i at time k.

Specification of priors
After the likelihood function of the Bernoulli-Exponential joint model distribution has been specified, the next step in the Bayesian model specification is the identification of a suitable prior.In this study, informative and non-informative priors are considered.Both priors serve important roles in Bayesian analysis, and the choice between them depends on the specific goals and available information in a given analysis [18].Noninformative priors, also known as weak,vague or diffuse priors, are designed to have minimal influence on the posterior distribution.They can make Bayesian analysis robust to situations where there is little prior information or when prior beliefs are uncertain.They prevent strong prior assumptions from biasing results when there is limited prior knowledge [19].One of the primary benefits of informative priors, on the other hand, is that they allow to incorporate expert domain knowledge and prior information into the analysis [20,21].This is invaluable when experts have insights that can improve parameter estimation, and, in situations with limited or noisy data, informative priors can lead to more stable and accurate parameter estimates.Finally, informative priors explicitly quantify prior beliefs and uncertainty, which allows to integrate these beliefs with observed data.In this study, for both informative and non-informative prior scenarios, we consider the vector of mean parameters (α) as having a multivariate normal distribution [19,[22][23][24].This is specified as; Furthermore, we consider the parameters associated with time or visit to similarly follow a Gaussian distribution; Note that the prior distributions of our joint model parameters are considered independent and thus, For the informative prior setting, fixed values for the prior means, (µ α , µ ϑ , µ ψ , µ ξ , µ ω ) and their correspond- ing variances ( α , ν ϑ , ν ψ , ν ξ , ν ω ) are adopted, since we do not have expert or historical estimates yet for these kind of studies.More specifically, we can denote the mean vector of α , µ α with a prior mean vector and correspond- ing covariance matrix as; where I s represents an identity matrix whose dimension depends on s individuals and φ .More broadly, we set pre- determined prior mean values for the visit parameters as; and their corresponding prior variances as Regarding the non-informative prior setting, two approaches are considered.First, Gaussian non-informative priors are adopted for all mean and variance parameters of both the response and time parameters.More (9) broadly, to express prior ignorance, the prior means (µ α , µ ϑ , µ ψ , µ ξ , µ ω ) are set to zero and the variance- covariance for φ α can be set as a diagonal matrix with large variance.Similarly the corresponding prior variances for the other parameters are set very large to express prior ignorance.Thus, the non-informative priors are set up as, For the second case of non-informative prior, we consider the Jeffreys prior [25] an appealing reference prior widely used in Bayesian inference.This prior is considered for the response/outcome parameters and Gaussian noninformative priors are still considered in this study for visit parameters.The Jeffreys prior is obtained by applying the Jeffreys rule which defines the prior density to be directly proportional to the square root of the determinant of the Fisher information matrix.That is, for a set of parameters θ = (θ 1 , . . ., θ n ) , the Jeffreys prior is given by, The Fisher information matrix is defined by, and L is the likelihood function that specifies the probability for data y given the parameters θ .It is appropri- ate so far as I(θ) is positive definite.Aside its geometric interpretation, one of the appealing reasons for its usage is the concept of parameterization invariance [26].This means that the prior is invariant with regards to one-toone transformations.The principle can be extended for multidimensional parameters.To establish ideas for the Jeffreys prior for response parameters, which result from the exponential family of distributions, the likelihood functions of the distributions and associated score vectors need to be specified.
Let φ i 's be known and X ′ assume a rank q.Also let, θ i = z x ′ i α and m −1 (φ i ) = φ −1 w .The likelihood func- tion for Generalized linear models with responses from the exponential family of distributions can generally be specified as; (11) The score vector is represented by; The resulting Fisher information matrix is specified as; Here, ∂η i and is an adjustment for the link function.
The Jeffreys prior thus for α assuming φ is known, is specified as Based on this derivation, Jeffreys non-informative prior considered for response parameters and Gaussian noninformative priors maintained for the visit parameters can be specified as;

Posterior distribution specification and Bayesian joint parameter estimation
The next step in the Bayesian model development is the specification of the posterior distribution, which has a directly proportional relationship with the model likelihood and the priors specified.For the scenario where Gaussian priors are considered for both the response and visit parameters and also for both informative and non informative settings, the resulting Bayesian Bernoulli-Exponential joint model posterior specification can be obtained as; (12) Also for the scenario where Jeffreys priors are considered for the parameters of the Bernoulli response and Gaussian priors for the visit parameters (non informative settings), the resulting Bayesian Bernoulli-Exponential joint model can be parameterized as; The next goal is to obtain posterior summary estimates for inference.Analytical calculations of the posterior distributions are possible, but often untenable due to laborious calculations involving the integration constant.Integral approximation methods can be adopted but only if few parameters are involved [19,24].In situations such as this study involving many parameters to be estimated, one can resort to Markov Chain Monte Carlo Methods (MCMC) [27].The MCMC methods are viable (17) simulation approaches for sampling from posterior distributions and computing posterior summary measures.
They are premised on a Markov Chain construction that subsequently converges to a so-called target distribution.
The two most popular MCMC methods are the Gibbs sampling and the Metropolis-Hastings algorithm [27][28][29].In this study, we adopt the Gibbs sampling procedure for generating samples from the joint posterior distributions of the unknown parameters in our model.
It is important to clarify, however, that the Gibbs sampler, performs iterative draws from posterior conditional distributions instead of directly sampling from the joint posterior distribution.This approach enhances the utility of the Gibbs Sampler, especially when dealing with complex joint posteriors that can be challenging to handle directly.Then, posterior summaries can be computed.In each step of the algorithm, random values are generated from unidimensional distributions [30].A brief summary of the Gibbs sampling algorithm is as follows; (a) Predetermined initial values θ (0) need to be specified.(b) For t = 1, . . ., T iterations, 1 . . ., θ r , then we can generate the new parameters by, are known as the full, complete or conditional distributions.Summarily, the Gibbs sampling algorithm helps to iteratively generate samples from our posterior distribution based on prespecified starting values.Initial portions of the Markov chains are discarded in an attempt to mask the influence of initial values.This is called the burn-in part.Resulting posterior summary measures such as the posterior mean, posterior standard deviation and Bayesian credible intervals are obtained from the MCMC output.Furthermore, we assess convergence of the Markov chains via the diagnosis of ergodic mean plots of estimated parameters and the Heidelberger and Welch diagnostic test which is a more formal convergence diagnostic method [31].

Model evaluation
To assess the Bayesian Bernoulli-Exponential joint model, the Bayesian model evaluation criteria called the Deviance Information Criterion (DIC) is used [32].The DIC measure comprises a "goodness of fit" and "complexity" term and is obtained as; where D(θ ) is the deviance calculated at the posterior mean of the parameters and p D characterizes the "effec- tive" number of parameters relating the complexity of the models.p D is the difference between the posterior mean deviance, D(θ ) and deviance calculated at the posterior mean of the parameters, D(θ ) .Smaller values of DIC jus- tify a better fit of the model.In line with this derivation, the DIC measure for the Bayesian Bernoulli-Exponential model is specified as;

Simulation study
In order to assess the Bayesian Bernoulli-Exponential model in terms of how it can be influenced by controlled variations in sample size, visit schema and types of prior distributions on the parameter estimates we present in this subsection, a simulation study.More precisely, the simulation study helps establish the validity of the joint model in random scenarios via data generation and parameter estimation.It is important to clarify, however, that this present study is an extension of the studies of Bronsert [6], Lin [33], Seo [8] and Zaagan [9] and thus for computational convenience, an abundant level of consistency is maintained in terms of simulation conditions.All simulations are performed in R software via the R2Open-Bugs package.This package provides a means to program Bayesian models in R via an OpenBugs software [34,35].
To develop the Bayesian joint model, the structure of the data to be simulated is clearly defined.We simulate data involving two categorical variables, each having three The visit times for each of the corresponding responses are simulated from an exponential distribution.Furthermore, we simulate design structures that consider varying visit schemes and sample sizes to effectively study trends or patterns associated with the model.In this study, three varying sample sizes with four sub design visit structures entailing both balanced and unbalanced visit structures are considered and shown in Table 2. Also, three prior schemes are considered, that is Gaussian informative, Gaussian noninformative and Jeffreys non-informative priors.
Thus, the simulation matrix involves three varying sample size designs, three varying prior schemes and three visit design structures.To further clarify the visit structure, as an example to signal an unbalanced visit pattern, when the sample size is 180 and the number of observations is 20 & 6 , this exemplifies 90 participants having 20 recorded observations and another 90 subjects have 6 measured outcomes each.This simulation design scheme results in 27 differing designs for the simulation analysis of the Bayesian Bernoulli-Exponential joint model.
After data generation, the simulation analysis involves estimating the joint model parameters via the package R2Openbugs in R software.It commences by first "sinking" in generated parameter values which that serve as initial values for the MCMC estimation process.Then, the likelihood of the Bayesian joint model is calculated based on the design structures and priors specified.Parameter estimation proceeds with the Gibbs Sampling approach, which has earlier been discussed.This generates dependent Markov chains for our model parameters by drawing samples from the posterior distribution using initial parameter values that were embedded in the simulation design.Markov chains are run iteratively 30,000 times, and the first 10,000 iterations are discarded to serve as burn-in, effectively mitigating the influence of the initial values.Thinning intervals of three iterations are considered to monitor autocorrelations of the generated values.Subsequently, to monitor convergence of Markov chains and their associated posterior parameters, the Heidelberger and Welch convergence tests are computed.Then, posterior summaries such as the mean, standard deviation, and credible interval limits are presented.It is instructive to note that the simulations were replicated a 1000 times and inferences were premised on the averaged estimates and associated credible intervals.Finally, inferences via comparisons for different specification of the prior distribution and their sample size and visit design schemes for the model are made along with Deviance Information Criterion measures.

Simulation results: model convergence assessment of the Bayesian Bernoulli-Exponential joint model
To evaluate convergence of the Markov chains of the model parameters, a formal diagnostic test, called the Heidelberger and Welch test [31] is used.It is expected that after the burn-in period, the Gibbs Sampling algorithm produces samples from the posterior distribution that attains a stationary distribution.The Heidelberger and Welch test constitutes a stationary and half-width

Simulation results: parameter estimation and evaluation of the Bayesian Bernoulli-Exponential model
In this section, the influence of controlled variations in sample size, visit sequences and type of prior distributions on the estimated parameters of the Bayesian Bernoulli-Exponential model are examined.Consistency in the direction of these estimates and their associated credible intervals are checked.For ease of reporting, we present a select number of results from the various simulation scenarios.Posterior means, standard deviations and credible intervals of select scenarios are presented in Tables 6, 7, 8, 9 and 10.
Fixing sample sizes and priors across scenarios and examining the effect of varying sequences on parameter estimates, a consistent trend in magnitude and direction of the estimates and their log-transformation were observed across all scenarios.For example, the parameter estimates of results obtained from the model when sample size and time sequence 54 (10) and 54(20&6) , 18 (10) and 18(5&3) , 180 (10) and 180(5&3) under informative prior scheme were not markedly different in terms of their magnitude and direction.As an example, the posterior means and standard deviations obtained for the model scenario, sample size and visit scheme 180 (10) under This further indicates that varying time sequences do not considerably affect the resulting estimates.Examining the credible interval(CI) widths under the different schemes reveal an interesting trend.As the sample sizes across all scenarios increased, albeit keeping priors and time sequences constant, the CI widths were increasingly narrow, implying that when our proposed model is applied to datasets of increasing sample sizes, the resulting estimates are obtained with higher precision.For instance, as an example, we compare parameter estimates and their CI widths under a select Gaussian non-informative prior scenario for these model scenarios 18 (10), 54 (10) and 180 (10) (see Table 11).The trend observed from the presented estimates are quite obvious; increasing sample sizes applied to the proposed Bayesian

Simulation results: evaluation of the Bayesian Bernoulli-Exponential model
Finally, model performance is evaluated under the various simulation scenarios via the Deviance Information Criterion (DIC).Since there are a lot of DIC values computed for varying scenarios, they are presented graphically for ease of evaluation and clarity.The DIC plots of the selected simulation scenarios applied to the model are presented in Figs. 1, 2, 3, 4, 5, 6, 7 and 8.
First, we fix sample sizes and compare how the model performs across the type of prior and visit sequence.Regardless of the kind of prior chosen for the model parameters, it is observed in Fig. 1 that in the smallest sample considered, 18, the model performs better for the time sequence 5&3 , reflected by lower DIC values across all prior scenarios.This is followed by the balanced time sequence, 10.In fact, there's no marked difference between the DIC value of the time sequence 5&3(599.8)and 10(628.4)when considering the Jeffreys prior and fixing the sample size at 18.This trend is consistently observed, even when the sample sizes are fixed  vary for the sample size 180, regardless of the prior chosen for sequence 5&3 and 20&6 .The results for the visit sequence 10 are quite consistent with 5&3 when compared.Models perform better in small sample size 18 scenarios as reflected by their lower DIC values, followed by 54.
The DIC values for sample size 54 and 180, however are close when the Jeffreys prior is considered for time sequence 10.Overall, model evaluation of the Bayesian Bernoulli-Exponential Model suggest a relatively better fit for small and medium sample size scenarios (18 and 54) with less varying time sequences (5 &3) and (10), regardless of prior choice.For larger samples (180), the models performs fairly well for less varying time sequences (5 &3) but not significantly so for time sequences (20& 6) regardless of the choice of prior.

A model application to bladder cancer recurrence data
In this section, the proposed Bayesian Joint Bernoulli-Exponential model is applied to a real-world dataset, called the Bladder Cancer Data.This data is openly available in R software, specifically in the "Survival" pack- age [36] and results from a clinical trial on patients with bladder cancer conducted by the Veterans Administration Co-operative Urological Research Group (VACURG) [10,11].The bladder cancer dataset in R software com- prises information on 85 subjects, measured four times, with randomly assigned treatments of only thiotepa or a placebo.38 patients are assigned to the placebo group and 47 to the treatment(thiotepa) group.Data on patient experienced number of recurrences are collected including the number of initial tumours present pre-trial randomization.Other variables include "stop", which measures the time interval in months since the last visit.The next scheduled visit is dependent on bladder tumor recurrence at the time of measurement, indicating that time can be considered informative, and that subsequent visits are likely be influenced by previous visits.Also, the intensity of visits depend on tumor recurrences.Furthermore, there is an "event" variable, which is a binary variable representing the recurrence of tumor( 1) or (0) for non-recurrence attributable to reasons like death.The variables along with their description are given in Table 12 below.This data is analyzed with the following objectives in mind.Is there an effect of treatment type, size in centimeters(cm) of the largest initial tumor, initial number of tumors on the likelihood of tumor recurrence?Furthermore, is there an effect of prior recurrences(outcomes) on the likelihood of current recurrence?To answer these research questions, our proposed Bayesian Bernoulli-Exponential Joint model is fitted to the data.The binary "event" variable is used as the response and the predictors included in the model are treatment type, size in cm of the largest initial tumor, initial number of tumors and other time variables.Just as previously discussed in the Data and methods section, the Bayesian model involves the specification of a joint likelihood, priors and then the posterior distribution.
Here, three types of priors are considered and compared across the models.In this regard, the non-informative Gaussian priors considered for this model is, The Gaussian Informative priors considered for this model is, Furthermore, we consider Jeffreys non-informative priors for the α parameters and Gaussian non-informative priors for the visit parameters.The resulting posterior distribution of the Bayesian Bernoulli-Exponential Joint model for the bladder cancer data, for the instance where the Jeffreys prior considered for the parameters of the Bernoulli response process and Gaussian priors for the visit parameters in non informative settings is considered is;    Here, V (α) = diag(v 1 , v 2 . . ., v n ) and v i = µ ik (1 − µ ik ) .and, • α s are regression parameters representing the effect of the predictors; treatment type(x 2 ), initial number of tumors, ( x 3 ) and size in (cm)(x 4 ) of the largest initial tumor on the likelihood of tumor recurrence.
• ψ represents the effect of the prior recurrence on the mean response of the current recurrence and ϑ characterizes the effect of current recurrence time on the mean recurrence, • ξ is a constant parameter associated with time and γ is the effect of the previous recurrence on the mean time.
Other components are already explained thoroughly in the Data and methods section.Note that the posterior distribution changes when the priors change in the Gaussian and non-Gaussian settings considered for all parameters.Then, after the posterior specification, we proceed with the joint parameter estimation with the Gibbs sampling approach in R software.For each of the three prior scenarios considered, the Markov chains are run iteratively 30,000 times, and the first 10,000 iterations are discarded to serve as burn-in.are presented in Table 13.Inferring from the tests conducted, no issues were observed with the convergence of the MCMC chains.Overall, we can proceed with posterior summary inference with precision since the MCMC chains are in a stationary distribution.
After convergence assessment of the model, inference based on the posterior summary measures is the next step.Posterior means, standard deviations and associated credible intervals of the prior scenarios are presented in Table 14 along with their corresponding DIC's.The best model is chosen based on the least DIC value.Observing the results, the model under the Jeffreys non-informative prior, yielded the least DIC (1108) value.Ergo, parameter inference is based on the Bayesian Bernoulli-Exponential model with Jeffreys prior specified.The results demonstrate that the effect of treatment type is statistically significant on the likelihood of cancer recurrence inferring from its credible interval α 2 = 0.216 (0.232, 0.411).The initial number of tumors have a significant effect α 3 = 0.036 (0.001, 0.108) on the likelihood of cancer recurrence and hence a significant prognostic factor.Furthermore, the size in cm of the largest tumor has a significant marker on the likelihood of cancer recurrence.Afterwards, the time parameters are observed.The effect of prior tumor recurrence on the mean response of current tumor recurrence, represented by ψ is statistically significant −0.408(−1.009,−0.135) , indicating that pre- vious tumor recurrences influence the probability of subsequent recurrences.Additionally, the effect of current recurrence time(ϑ ) is significant on average recurrence, reflected by the estimated probability (0.157) (0.018, 0.337).

Discussions and conclusions
Broad assumptions underlie the usage of longitudinal analysis approaches, ranging from univariate designs to the even the most complex conditional and marginal modeling approaches.One of the common assumptions, albeit implausible in certain scenarios, is the supposition that time is always fixed and predetermined by statistical design.Phenomenons may alter the time trajectory of study subjects, like sickness or adverse events in clinical trials, which may result in not only irregular time points for subjects, but also imbalanced data and differing visit intensities.This implies current visit outcomes being informative to subsequent ones.It is also important to emphasize that the issue of informative censoring may be less problematic in the context of an informative time/schedule designs, given the assumed observation schedule protocols.In simpler terms, individuals with more severe conditions requiring early interventions or treatments, which could lead to informative censoring, would also have shorter observation schedules and, consequently, more "frequent" measurements.This assumption underlies the simulation design for this study.In this article, we have developed a Bayesian joint model for longitudinal outcomes from the exponential family of distributions with particular emphasis on Bernoulli distributed longitudinal outcomes and exponentially distributed informative time points.An assessment of the influence of controlled sample size scenarios, visit and prior specification schemes on the estimated parameters of the proposed Bayesian Bernoulli-Exponential joint model was performed via simulations and was evaluated based on Deviance Information Criteria.The methods commenced with specifying likelihoods for the joint outcome and time distributions, specification of priors, and then a discussion on the Markov Chain Monte Carlo Approach for estimating posterior parameters.The priors considered were Gaussian informative priors, Gaussian non-informative priors and Jeffreys non-informative priors.Convergence analysis was performed with the Heilderberg and Welch Test.Once the models converged, posterior inference followed and models were evaluated based on Deviance Information Criteria.Inference from the Heidelberger and Welch Tests conducted across selected simulation scenarios for the Bayesian Bernoulli-Exponential broadly suggested no pertinent issues with the convergence or stationarity of MCMC chains for estimated parameters irrespective of prior specified, sample size or visit schemes.Fixing sample sizes and priors across selected scenarios of the model and examining effect of varying sequences on parameter estimates, a consistent trend in magnitude and direction of the estimates and their transformations were observed.
As sample sizes increased, albeit keeping priors and time sequences constant, credible interval widths were increasingly narrow, indicating that when the proposed model is applied to datasets of increasing sample sizes, resulting estimates are obtained with higher precision.Overall, evaluation made for the Bayesian Bernoulli-Exponential model indicated better performance for the less intense visit sequence 5&3 scenario, reflected by lower DIC values, followed by the balanced visit sequence 10 regardless of sample size or prior type.Sample sizes across various simulation scenarios performed similarly well, only that the difference in performance was largely attributable to the sequence of individual visits.Finally, the proposed model has been applied to a bladder cancer recurrence data to serve as an application example.

Fig. 8
Fig. 8 DIC plot for keeping prior fixed at jeffreys non-informative and examining influence across sample size and design schemes levels, and two continuous variables.The longitudinal responses are simulated from a Bernoulli distribution.The first response is simulated from the distribution, and then the subsequent response is computed based on the relationship between the prior outcome and the prior time for predicting the average response based on starting parameter values in Table1.It is important to clarify, however that during the simulation exercise, only "plausible" starting values from the range of starting values in Table1are utilized.It is not the intent of this study to analyze the impact of all four range of starting values.

Table 1
Parameter initial value scheme for simulations

Table 2
Simulation design scheme a test statistic to accept or reject the null hypothesis that the Markov chains are from a stationary distribution.The half-width test is based on a computed 95% confidence interval for the mean, using the chain that earlier passed the stationarity test.The resulting ratio of the interval half-width and the mean compared with a threshold ( ε = 0.1 ) determines whether the half-width test is passed or not.More precisely, the test passes if the ratio between the half-width and the mean is lesser than ε .Selected convergence results based on the Heidelberger and Welch test are presented for the Bayesian Bernoulli-Exponential joint model across select scenarios and shown in the Tables 3, 4 and 5.
or visit schemes were statistically insignificant.This suggests that the sampled values for parameters are from a stationary process.A further indication is that our model parameter estimation can be implemented with precision because MCMC chains are in a stationary distribution.

Table 3
Heidelberger and welch test for the Bayesian Bernoulli-Exponential model and for the Gaussian informative prior

Table 5
Heidelberger and welch test for the Bayesian Bernoulli-Exponential model and for the Jeffreys non-informative prior

Table 6
Table of parameter estimates for the Bayesian Bernoulli-Exponential joint model and for the Gaussian informative prior scheme 5&3 , the model performs better overall for sample size 18 and 54 regardless of prior chosen.No marked differences are observed however when the Jeffreys prior is used for sample size 18 and 54 as evidenced by Fig.5.Furthermore, model performance does not broadly

Table 7
Table of parameter estimates for the Bayesian Bernoulli-Exponential joint model and for the Gaussian non-informative prior scheme

Table 8
Table of parameter estimates for the Bayesian Bernoulli-Exponential joint model and for the Jeffreys non-informative prior scheme

Table 9
Credible interval widths for selected scenarios for the Bernoulli-Exponential model

Table 10
Table of parameter estimates for the Bayesian Bernoulli-Exponential joint model and for the Jeffreys non-informative prior scheme

Table 11
Credible interval widths for selected scenarios for the Bernoulli-Exponential model

Table 12
The bladder cancer data (called bladder) in R software

Table 13
Heidelberger and welch test for the Bayesian Bernoulli-Exponential model for the bladder cancer data including three prior scenarios

Table 14
Results of the Bayesian Bernoulli-Exponential model applied to the bladder cancer data with different prior scenarios considered